Offline Automatic Segmentation based Recognition of Handwritten Arabic Words

نویسندگان

  • Laslo Dinges
  • Ayoub Al-Hamadi
  • Moftah Elzobi
  • Zaher Al Aghbari
  • Hassan Mustafa
چکیده

The world heritage of handwritten Arabic documents is huge however only manual indexing and retrieval techniques of the content of these documents are available. To facilitate an automatic retrieval of such handwritten Arabic document, a number of automatic recognition systems for handwritten Arabic words have been proposed. Nevertheless, these systems suffer from low recognition accuracy due to the peculiarities of the handwritten Arabic language. Thus, in this Paper we propose a segmentation based recognition system for handwritten Arabic words. We divide a handwritten word into smaller pieces of a word and then these small pieces are segmented into candidate letters. These candidate letters are converted into their correspondence chain-code representation. Thereafter we extract discrete, statistical and structural features for classification. Additionally, we introduce a novel active contour based feature to increase the recognition accuracy of strongly deformed Arabic letters. We also use a decision tree to reduce the number of potential classes. We then use a neural network to compute weights for all statistical features and use them as input for a k-NN classifier. Our experiments show that the extracted features by our technique achieve higher recognition accuracy as compared to other features.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Off-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model

In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...

متن کامل

Component-based Segmentation of Words from Handwritten Arabic Text

Efficient preprocessing is very essential for automatic recognition of handwritten documents. In this paper, techniques on segmenting words in handwritten Arabic text are presented. Firstly, connected components (ccs) are extracted, and distances among different components are analyzed. The statistical distribution of this distance is then obtained to determine an optimal threshold for words se...

متن کامل

A Survey on Arabic Character Recognition

Off-line recognition of text play a significant role in several application such as the automatic sorting of postal mail or editing old documents. It is the ability of the computer to distinguish characters and words. Automatic off-line recognition of text can be divided into the recognition of printed and handwritten characters. Off-line Arabic handwriting recognition still faces great challen...

متن کامل

A Framework for Arabic Handwritten Recognition Based on Segmentation

Automatic off-line Arabic handwriting recognition still faces a big challenges. Due to the cursive nature of the Arabic language, most of published works are based on recognition of a whole word without segmentation. This paper presents a new framework for the recognition of handwritten Arabic words based on segmentation. This framework involves two phases (training phase and testing phase). In...

متن کامل

Word based off-line handwritten Arabic classification and recognition : design of automatic recognition system for large vocabulary offline handwritten Arabic words using machine learning approaches

......................................................................................................................................... I ACKNOWLEDGMENT .................................................................................................................... III PUBLICATIONS ...............................................................................................................

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011